# 16K long context
ALP DeepScaleR 1.5B C16K
Apache-2.0
ALP_DeepScaleR_1.5B_C16K is a model trained using the Adaptive Length Penalty (ALP) method based on the DeepScaleR-1.5B model, which can significantly reduce token usage while maintaining performance.
Large Language Model
Safetensors
A
SynthLabsAI
333
1
Fathom R1 14B RS
MIT
Fathom-R1-14B is a project based on the R1-distilled-14B model, achieving o4-mini level mathematical reasoning ability under a 16K context with a low training cost of $499.
Large Language Model
Transformers

F
FractalAIResearch
404
1
Phi 4 GGUF
MIT
phi-4 is an open-source language model developed by Microsoft Research, focusing on high-quality data and reasoning capabilities, suitable for memory/computation-constrained environments.
Large Language Model Supports Multiple Languages
P
Mungert
1,508
3
Aya Vision 8b
Aya Vision 8B is an open-weight 8-billion-parameter multilingual vision-language model supporting visual and language tasks in 23 languages.
Image-to-Text
Transformers Supports Multiple Languages

A
CohereLabs
29.94k
282
Featured Recommended AI Models